OCR Based Thresholding
نویسندگان
چکیده
In large-scale digitization processes, several common tasks are performed to provide an electronic version of a paper document. One of the first steps is the thresholding of the image, which is necessary for the following procedures to work properly. Many binarization methods have been proposed to solve this problem, but they need to be tuned on the target document corpus to obtain best results. In this paper, we introduce a full automatic thresholding method for printed document analysis. The purpose is to obtain the most suitable binarizer for a given document image according to the quality of the output of an OCR system. Tuning can be done either on a full page or on sample text-lines extracted from a page image. As opposed to existing methods, the tuning is directly goal-directed and does neither depend on subjective visual evaluation nor on non-representative performance criteria. We demonstrate the effectiveness of this approach on a subset of 740 pages from the Google 1000 Books dataset. Results show, that by choosing the right binarizer parameters with the Recognition Driven Thresholding (RDT) method the words-in-dictionary error rate of an OCR system can be reduced by 6%.
منابع مشابه
Adaptive pre-OCR cleanup of grayscale document images
This paper describes new capabilities of ImageRefiner, an automatic image enhancement system based on machine learning (ML). ImageRefiner was initially designed as a pre-OCR cleanup filter for bitonal (black-and-white) document images. Using a single neural network, ImageRefiner learned which image enhancement transformations (filters) were best suited for a given document image and a given OCR...
متن کاملBinarising Camera Images for OCR
In this paper we describe a new binarisation method designed specifically for OCR of low quality camera images: Background Surface Thresholding or BST. This method is robust to lighting variations and produces images with very little noise and consistent stroke width. BST computes a ”surface” of background intensities at every point in the image and performs adaptive thresholding based on this ...
متن کاملOptimal Parameter Selection Technique for a Neural Network Based Local Thresholding Method
Abstract Thresholding of a given image into binary image is a necessary step for most image analysis and recognition techniques. In document recognition application, success of OCR mostly depends on the quality of the thresholded image. Non-uniform illumination, low contrast and complex background make it challenging in this application. In this paper, selection of optimal parameters for Neural...
متن کاملDenoising of Document Images using Discrete Curvelet Transform for OCR Applications
In this paper, a denoising and binarization scheme of document images corrupted by white Gaussian noise and Impulse noise is presented using Curvelet Transform. The ability of sparse representation and edge preservation of Curvelet transform is utilized. Impulse noise gets added during document scanning or after binarization of scanned document images. White Gaussian noise corrupts the document...
متن کاملDocument Image Binarization Using Retinex and Global Thresholding
Document images are usually degraded in the course of photocopying, faxing, printing, or scanning. Degradation problems seems negligible to human eyes but can be responsible for an abrupt decline in accuracy by the current generation of optical character recognition (OCR) systems. In this paper we present a binarization method based on retinex theory followed by a global threshold. The proposed...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009